Robust Speech Recognition using Vocal Tract Normalization for Emotional Variation
نویسندگان
چکیده
منابع مشابه
Vocal Tract Length Normalization for Large Vocabulary Continuous Speech Recognition
Generally speaking, the speaker-dependence of a speech recognition system stems from speaker-dependent speech feature. The variation of vocal tract length and/or shape is one of the major source of inter-speaker variations. In this paper, we address several methods of vocal tract length normalization (VTLN) for large vocabulary continuous speech recognition: (1) explore the bilinear warping VTL...
متن کاملDynamic Vocal Tract Length Normalization in Speech Recognition
A novel method to account for dynamic speaker characteristic properties in a speech recognition system is presented. The estimated trajectory of a property can be constrained to be constant or to have a limited rate-of-change within a phone or a sub-phone state. The constraints are implemented by extending each state in the trained Hidden Markov Model by a number of property-value-specific sub-...
متن کاملEecient Vocal Tract Normalization in Automatic Speech Recognition
In this paper we study the eeect of vocal tract normalization (VTN) on the word error rate (WER) in speaker independent large vocabulary speech recognition. Evaluation test results are reported for the German VerbMobil II (VM II) and the English Wall Street Journal (WSJ) corpus. In particular, we analyse: the eeect of the type of warping function (linear vs. non-linear) on the WER; diierent met...
متن کاملAugmented Cepstral Normalization for Robust Speech Recognition
We proposed an augmented cepstral mean normalization algorithm that differentiates noise and speech during normalization, and computes a different mean for each. The new procedure reduced the error rate slightly for the case of sameenvironment testing, and significantly reduced the error rate by 25% when an environmental mismatch exists over the case of standard cepstral mean normalization.
متن کاملEfficient Cepstral Normalization For Robust Speech Recognition
In this paper we describe and compare the performance of a series of cepstrum-based procedures that enable the CMU SPHINX-II speech recognition system to maintain a high level of recognition accuracy over a wide variety of acoustical environments. We describe the MFCDCN algorithm, an environment-independent extension of the efficient SDCN and FCDCN algorithms developed previously. We compare th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Korean Institute of Intelligent Systems
سال: 2009
ISSN: 1976-9172
DOI: 10.5391/jkiis.2009.19.6.773